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Abstract. We investigate the size distribution of the maximum areas and the instantaneous distribution of areas 
of sunspot groups using the Greenwich sunspot group record spanning the interval 1874 - 1976. Both distributions 
are found to be well described by log-normal functions. Using a simple model we can transform the maximum 
area distribution into the instantaneous area distribution if the sunspot area decay rates are also distributed log- 
normally. For single- valued decay rates the resulting s napshot distrib ution is incompatible with the obs ervations. 
The current analysis therefore supports the results of iHowardl lll992l) and iMartmez Pillet et all il993h . It is not 
possible to distinguish between a linear and a quadratic decay law, however, with the employed data set. 
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1. Introduction 

Sunspots appear dark on the solar surface and typically 
last for several days, although very large ones may live for 
several weeks. Sunspots are concentrations of magnetic 
flux, with kG magnetic field strengths. Usually, sunspots 
come in groups containing two sets of spots of opposite 
magnetic polarity. 

The size spectrum of sunspots ranges from 3 MSH (micro 
solar hemispheres) for the smallest l|Brav fc Loughheadl 
Il964j) to more than 3 000 MSH for very large sunspots. 
A quantitative study of the size distribution of s unspot 
umbrae has been presented bv lBogdan et alJ l|l988h . They 
found a log-normal size distribution by analysing a dataset 
of more than 24000 Sunspot umbral areas determined 
from Mt. Wilson white-light images. Since the ratio of um- 
bral to penumbral area depends only very slightl y on the 
sunsp ot size (see the references and discussion in SolankiL 
|2003|) such a distribution can be expec t ed to be valid 
for sunspots as a whole. iBogdan et alJ l)l988j) used all 
sunspot observations in their sample to determine their 
size distribution. Since many sunspots live multiple days, 
the same sunspot appears multiple times in their statis- 
tics. Furthermore, in the course of its evol ution, the size 
of a su nspot changes. Hence the method of IBogdan et alJ 
l)l988(l provides the instantaneous distribution of sunspot 
sizes at any given time. This, however, does not in gen- 
eral correspond to the initial size distribution of sunspots, 
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i.e. the distribution of freshly formed sunspots. This is ex- 
pected to be very similar to the distribution of the max- 
imum sizes of sunspots, given that sunspots grow very 
fast and decay slowly. For many purposes, however, the 
latter distribution is the more useful one. An example is 
when the total amount of magnetic flux appearing on the 
solar surface in the form of sunspots needs to be esti- 
mated (since the field stre ngth averaged over a full su nspot 
is remarkably constant (Solanki & Schmi jU E993), the 
sunspot area is a good measure of the total magnetic flux). 
The purpose of this paper is to determine the distributions 
of both, the instantaneous sizes and the maximum sizes, 
and to compare these with each other. We determine the 
size distribution function of sunspot umbrae and of to- 
tal sunspot areas from the digitized version of the daily 
sunspot observations of the Royal Greenwich Observatory 
(RGO). 



2. Dataset and analysis procedure 

The GPR (Greenwich Photohcliographic Results) provide 
the longest and most complete record of sunspot areas, 
spanning observations from May 1874 to the end of 1976. 
However, only the areas of complete sunspot groups and 
not of individual sunspots have been recorded. The area 
covered by the sunspots of a group is measured every time 
it is observed, i.e. every day. Besides employing these val- 
ues we followed each sunspot group until it reached its 
maximum area. This area was stored separately. We em- 
ploy in all cases true areas corrected for projection effects. 
These stored areas can now be used to derive two dif- 
ferent distributions of sunspot areas. If we simply form 
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Fig. 1. Size distribution function of umbral areas obtained 
from the maximum development method (circles) and 
snapshot method (crosses). The log-normal fits are over- 
plotted (solid line: Fit to maximum area distribution, dot- 
ted line: Fit to snapshot distribution). The vertical line 
indicates the smallest umbral area considered for the fits. 
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Fig. 2. Size distribution function of the total spot group 
areas (umbra+penumbra) obtained from the maximum 
development method (circles) and the snapshot method 
(crosses). Overplotted are the log- normal fits for A > 
60 MSH (solid line: Maximum development method, dot- 
ted line: Snapshot method). 



the distribution obtained from all the measured areas, 
we obtain the average distribution of sunspot sizes at 
any random instance. We call this the snapshot distribu- 
tio n. The snapsh o t dist ribution also underlies the study 
of iBogd an et al.l l)l988j) . In general, this instantaneous 
size of a sunspot group will be smaller than the size of 
the sunspot group at its full development. In the second 
method, hereafter called maximum development method, 
the area of a sunspot group is taken at the time when the 
group has reached its maximum area. The maximum size 
is usually reached early in the development of a sunspot or 
sunsp ot group. It is followed by a steady decay I Mcintosh! 

The maximum group area A$ determined from the 
Greenwich data is in general too small. Since only one 
observation per day is available and thus the maximum 
area of the spot group can be reached several hours before 
or after the measurement. As we consider spot groups, 
the different spots in the group may reach their maximum 
area at different times. Therefore, Aq is in general some- 
what smaller than the sum of the maximum areas of all 
the sunspots in the group. The area distribution of indi- 
vidual sunspots can be partly estimated by considering 
separately just groups of type 0, i.e. those containing just 
a single spot. 

Also, visibility and proje ction effects lead to too small 
areas in the observations l)Kopeckv et all Il985^) affecting 
both distributions. The RGO dataset that we use is al- 
ready corrected for foreshortening. Nevertheless, in order 
to minimize the errors resulting from visibility corrections 
we use only spot groups measured within ±30 ° from the 
central meridian. When determining the maximum area of 
a sunspot group, we make sure that the maximum extent 
is reached within a longitude ±30 ° although the sunspot 
group does not necessarily have to be born within this an- 



gle. 

We replace the continuous size distribution function 
dN/dA by the discrete approximation AN/ A A, where AA 
is the bin width and AN is the raw count of the bin. Our 
criterion for the bin width is 20 % of the geometric mean 
area of the bin. We include in our analysis only sunspot 
groups whose areas exceed a lower cut-off limit A m i n . For 

umbral areas we set t he limit to ^4"" b = 15 MSH. This is 

I > """ I i I 

Martinez Pillet et alJ ljI993j) , which 

they imposed when analyzing the same data set. For total 
spot areas we set the cut-off limit to A\°\ n = 60 MSH. 
Smaller areas than A m { n are not taken into account in 
this study, as they are falsified from enhanced intrinsic 
measurement errors as well as from distortions due to at- 
mospheric seeing. 

In order to make the size distributions for different 
datasets comparable, we divide AN/AA by the total num- 
ber of spots exceeding A m - m . This corresponds to a nor- 
malization 

f°° dN 

Finally, we fit each empirical dis tribution with an ana lyt- 
ical function. In agreement with iBoedan et all l)l988|) we 
find that a log-normal function, i.e. a continuous distribu- 
tion in which the logarithm of a variable has a normal dis- 
tribution, provides a good description. The general form 
of a log-normal distribution is 



In 



dN 

~dX 



(\nA-\n(A)f 
2 In a a 
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dN 
~dA 



(2) 



where (diV/dA) max is the maximum value reached by the 
distribution, (^4) is the mean area and a a is a measure of 
the width of the log-normal distribution. Note that a log- 
normal function appears as a parabola in a log-log plot. 
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Fig. 3. Maximum area distribution (circles) and snapshot 
distribution (crosses) of total spot areas for single spots. 
Fits to the data for A > 60 MSH: maximum development 
method (solid line), snapshot method (dotted line). 
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Fig. 4. Snapshot distribution of umbral spot areas for sin- 
gle spots ( crosses), fit to the d ata (dotted line) and the 
curve from lBogdan et all (^988) (solid line). 



Log-normal distributions have been found in various fields 
of natural sciences. Examples are the size of silver par- 
ticles in a photographic emulsion, the survival time of 
bacteria in disinfectants or aer osols in industrial atmo- 
spheres ijCrow fc ShimizuLll988j) . or, within solar physics, 
the distribution of EUV radiances in the quiet Sun 
iPamuhnetallE)?^. 



3. Results for RGO spot group areas 

3.1. Umbrae 

The size distributions of the umbral areas obtained from 
both, the snapshot method and the maximum develop- 
ment method, are shown in Fig.^ For both methods, the 
resulting size distribution is well described by a log-normal 
function above the lower cut-off A m - ln . As one would ex- 
pect, the curve of the maximum areas lies above the snap- 
shot curve for large sunspots. For smaller areas, the snap- 
shot distribution is higher, resulting from the fact that 
the areas obtained with the snapshot method are smaller 
(since they include sunspots at different stages of decay), 
thus leading to more counts for smaller areas. The fit pa- 
rameters are listed in Table 1. It is at first sight surprising 
that the size distributions obtained by both methods do 
not differ by a larger amount than suggested by Fig. ^ 
In general, the two distributions are expected to be more 
similar to each other if the lifetime of sunspots approaches 
the sampling time of the data, i.e. 1 day. For sunspots 
with shorter lifetimes both methods should give identical 
results. Therefore, the small difference between the two 
distributions is consistent with a relatively short average 
lifetime of sunspots. 

The umbral areas for single spots from RGO are roughly 
a factor of 2 — 3 larger than the corresponding areas from 
the Mt. Wilson white light plate collection. This differ- 
ence probably is largely due to the fact that the RGO 



areas are s unspot group areas w hile the Mt. Wilson data 
analysed bv lBogdan et al.1 l|l988j) give the areas of individ- 
ual spots. However, since there are systematic differences 
also between the to tal areas of all the spots on a given da y 
between data sets l|Fligge fc SolankiL Il997t iFosterl 120041) , 
other systematic differences are also likely to be present. 
Systematic differences lead to a shift of the RGO area dis- 
tribution towards higher values of (A) and smaller values 
of a a (Table 1) with respect to the Mt. Wilson dataset. 
The smaller value of a a results from the logarithmic na- 
ture of the distribution. 



3.2. Total areas 

Fig. |2 shows the distributions for the total spot areas, 
i.e. the sum of umbral and penumbral area. Log-normal 
fits match both distributions rather well above the cut- 
off. However, both distributions differ even less from each 
other than when only the umbrae are considered (Fig. ^) . 
Especially in the large area regime, both distributions are 
almost indistinguishable. Since every sunspot must have 
an umbra, it is not clear why the difference between the 
two distributions in Figgis smaller than in Fig.fJ] unless 
it is an indication of the limits of the accuracy of the 
data. It may also be indicating that the decay law may be 
different for umbrae and sunspots as a whole. 

3.3. Total area of single spots 

In this part of the study, we extracted only Greenwich 
sunspot groups of type 0, i.e. single spots (Fig. |3J). In or- 
der to get a statistically significant dataset, we had to 
extend our longitudinal constraints to ±60° around disk 
center. 

The difference between the snapshot and the maximum 
area distribution is more pronounced for total areas of 
single spots than for total areas of all sunspot groups. 
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The difference in the two distributions can be explained 
by a similar argument as in Sect. I5TT1 The maximum dis- 
tribution dominates for large areas, whereas the snapshot 
distribution shows more counts for smaller areas due to 
the inclusion of different decay stages of the sunspots. The 
similarity between Figs^and^Jsuggests that the problem 
lies with Fig.0 It may be that when determining the total 
area of sunspot groups, areas of the generally short-lived 
pores were included in the Greenwich data set. 

3.4. Umbral areas of single spots 

Of special interest is the snapshot distribution of umbral 
areas of single spots (Fig. b ecause this can directly be 
compared to the results of lBogdan et alJ l|l988j) . The RGO 
dataset displays a significantly flatter distribution than 
the Mt. Wilson data, i.e. the ratio of large umbrae to 
small umbrae is bigger for the RGO data. This system- 
atic difference between the data sets is an indication of a 
systematic difference between sunspots in groups of type 
and other spots. The parameter (A) is roughly a factor 
of 2 smaller than in the corresponding Mt. Wilson data, 
while the width of the distribution is larger. 

4. Modeling the snapshot distribution 

4.1. Model description 

We have developed a simple sunspot decay model that 
simulates the snapshot distribution resulting from a given 
maximum area distribution. One aim of this modelling 
effort is to find out to what extend it is possible to dis- 
tinguish between decay laws from the difference between 
the maximum area and the snapshot area distributions. 
Another aim is to test if, with decay laws as published in 
the literature, both the maximum and snapshot area dis- 
tributions must have the same functional form (e.g. both 
be log-normally distributed). 

We consider two kinds of maximum development distri- 
butions: a lognormal distribution J3J and a power-law dis- 
tribution of the general form 

h(A) = vA w . (3) 

The latter is inspired by the power-law distribution of so- 
lar bipolar magnetic regions, i.e. acti ve and ephemeral ac- 
tive regions l|Harvev & ZwaanL 

We assume an emergence rate of 10 000 spots per day. 
The absolute number of emerging spots does not influ- 
ence the results as they are normalized (Eq. ^ and this 
high number is chosen in order to obtain statistically sig- 
nificant distributions. The constant emergence rate is a 
reasonable approximation of the solar case during a small 
period of time, i.e a few months, which is the length of 
time over which we let the model run. 
Once the spots have emerged they begin to decay immedi- 
ately (the form ation time of spots is short, i.e. hours (e.g. 
ISolankil |2008h . and is thus neglected in the model). 



Table 1. Overview of the log-normal fit parameters. Due 
to the normalization Q there are only two free parameters 
(A) and a a- 



Data Set Method (A) a A No. of Fig. 

Sunspots 
or Groups 



Mt. Wilson 


Bogdan 


0.62 


3.80 


24 615 




Umbrae 


et al. 










Umbrae 


Max. dev. 


11.8 


2.55 


3 966 


1 


Umbrae 


Snapshot 


ion 
iz.U 


Z.Z4 


Ol 411 


1 
1 


Total area 


Max. dev. 


62.2 


2.45 


3 926 


2 


Total area 


Snapshot 


OO. O 


z.4y 


9 A ££0 
04 OOZ 


z 


Total area 


Max. dev. 


45.5 


2.11 


939 


3 


single spots 












Total area 


Snapshot 


30.2 


2.14 


15203 


3 


single spots 












Umbral area 


Snapshot 


0.27 


6.19 


11312 


4 


single spots 












Model 


Max. dev. 
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Model 


Snapshot 












hourly 


7.77 


2.80 


21352 828 


5 a 




daily 


8.67 


2.73 


1 092 295 


5 a 




3 days 


9.89 


2.69 


525 605 


5 a 



There has been considerable debate regarding the de- 
cay law of sunspots. A number of authors have ar- 
gued fo r a linear decay of sunspot areas with t ime (e.g. 
iBumbal. |l9t^3j:|Moreno-Insert;is Yazquezl. Il988'). Others, 
e.g. lPetrovav fc van Driel-Gesztelvil l|1997|) . found that the 
decay rate of a sunspot is related to its radius and thus 
is parabolic. The quadratic decay is also favored by mod- 
els that explain the erosio n of a sunspot as m agnetic flux 
loss at the spot boundary l|Mever et all ll 973) . Still others 
could not distinguish between a line ar and a quadratic de- 
cay b a sed on the availa ble da ta fe.gJMarti'nez Pillet et all 
Il993|) . iHowardl l|l992|) and iMartmez Pillet et alJ (|l993> 
found that the sunspot decay rates are log-normally dis- 
tributed. In view of the partly controversial situation we 
have computed models with all 4 possible combinations: 
a) quadratic decay law with log-normally distributed de- 
cay rates, b) quadratic decay law with a single, universal 
decay rate, c) linear decay law with a log-normal decay 
rate distribution and d) linear decay law with a constant 
decay rate. The parabolic decay law we implement has the 
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(t - to) 



with the added condition A(t — t > A /D) 
employed linear decay law has the form 

A(t) =A -D(t-t ), 



(4) 



0. The 



(5) 



with A(t — to > Ao/D) = 0. The decay rates D are ei- 
ther given the same specified value for all sunspots in the 
modelled sample, or are obtained from a random num- 
ber generator providing a log-normal distribution with 
a mean u = 1.75 and a variance a 2 = 2 following 
iMartmez Fillet et all l|l993[) . 

Combining the maximum area distribution with the decay 
law (Eq. 01 or |SJ we can determine the resulting snapshot 
distribution, which can then be compared with the ob- 
served distribution. We simulate an interval of 100 days 
after an initialization time of 100 days in order to make 
sure that a reasonable mix of old, partly decayed spots and 
newly emerged spots is present. We take the fit parameters 
for the umbral maximum development distribution from 
Sect. |3 as the starting distribution of our model. 

4.2. Results from the model 

Snapshot distributions resulting from a quadratic decay 
law of the form Eq. (0J with log-normally distributed 
decay rates are plotted in Fig.Ela for 3 different sampling 
times. The first result is that the snapshot distributions 
can also be fitted well by log-normal functions. A sam- 
pling rate of 1 day corresponds to the RGO dataset and 
thus can be compared with the results for umbral areas in 
Fig-H The modelled snapshot distribution matches quite 
well the observed snapshot distribution above the cut-off 
limit. At a sampling rate of 3 days, both distributions, 
maximum development and snapshot, lie very close 
together. Such a large observing interval is comparable 
to the average lifetime of the spots, so that it becomes 
difficult to distinguish between the two distributions. For 
an observing frequency of 1 hour, a sampling frequency 
provided by the MDI dataset, the difference between the 
distributions is somewhat larger, as more decay stages 
of the spots are included in the snapshot data. When 
considering such a short sampling interval the formation 
time of the spot group becomes important and has to be 
taken into account, which is not included in our model. 
In the next step, we replace the log-normally distributed 
decay-rates in Eq. (0} by constant decay rates (Fig.[5]b). 
It is interesting that for all constant decay rates the 
snapshot distribution curves lie above the maximum 
area distribution for large sunspot areas. At first sight 
this appears counter-intuitive: how can the snapshot 
distribution show more large spots than the distribution 
of spot areas at maximum development? The answer 
lies in the normalization. For a single decay rate, small 
sunspots decay uniformely, so that after a given time 
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Fig. 5. Results from the model for a quadratic decay-law 
for (a) log-normally distributed decay rates and sampling 
times of 1 hour, 1 day and 3 days and (b) for constant 
decay rates D = 5 MSH/day, D — 20 MSH/day and D = 
50 MSH/day and a sampling time of 1 day. 

a certain fraction has become smaller than the cut-off 
area and the distribution is therefore skewed towards 
larger spots, whose relative (but not absolute) numbers 
increase. The reason therefore is the normalization of the 
distributions. For a high decay rate (e.g. 50 MSH/day) 
both distribution curves lie closer together than for small 
decay rates (e.g. 5 MSH/day). This is understandable 
because a small decay rate affects more the smaller spots 
than the larger spots. 

In order to see how the decay law affects the results, 
we repeat the above exercise, but for a linear decay law 
(Fig. [fj) . Qualitatively, a similar behaviour for both cases 
can be observed as in the case of a quadratic decay-law, 
e.g. for constant decay rates the snapshot distributions lie 
above the maximum area curve. When using log-normally 
distributed decay rates in the linear decay law (Eq.^J, the 
resulting snapshot curves for the three different sampling 
times are almost indistinguishable. We conclude from our 
model that it is not possible to distinguish between a 
linear and a quadratic decay-law by this analysis based 
on the Greenwich data. 
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Fig. 6. Results from the model for a linear decay-law 
for (a) log-normally distributed decay rates and sampling 
times of 1 hour, 1 day and 3 days and (b) for constant 
decay rates D = 5 MSH/day D = 20 MSH/day and 
D = 50 MSH/day and a sampling time of 1 day 



A variability of the decay rates (log-normal distribution) 
thus seems necessary to yield the generally observed 
behaviour that the maximum area curve in general lies 
above the snapshot curve. 

Finally, we check if a power-law distribution of the maxi- 
mum development areas could also lead to a log-normal 
snapshot distribution. A power-law si ze distribution with 
an exponent —2 has been found by lHarvevI |l993) for 
active regions using Kitt Peak magnetograms. Since 
active regions harbour sunspots, it might be worth 
testing if the maximum area distribution is similar to 
or very different from that of the host active regions. 
To this purpose we insert a maximum size distribution 
dN/dA ~ A~ 2 in our model. This does not yield a 
log-normal snapshot distribution but rather something 
very close to a power-law, irrespective of the decay law. 
To make sure that this result is not an artefact of the 
special choice of the exponent of the power-law, we ran 
the same simulations with powers between —1.0 and 
—3.0. In all cases we can exclude a transformation of 
the power-law distribution for the maximum areas into a 



log-normal snapshot distribution. 



5. Conclusion 

The size distribution for both, umbral and total spot area, 
has a pronounced, smooth log-normal shape above our 
lower cut-off limit. This is true for both, the instantaneous 
distribution of sunspot sizes (snapshot distribution) and 
for the distribution of sizes at the time of maximum de- 
velopment of each sunspot group. These two distributions 
are rather similar, with the snapshot distribution being 
slightly steeper, in general. 

We have studied what can be learnt about sunspot de- 
cay from the comparison of these distributions, by start- 
ing from the maximum development size distribution and 
computing the snapshot distribution for different decay 
laws and parameters. 

Both, linear and quadratic decay laws, yield qualitatively 
similar results, making it impossible to distinguish be- 
tween them by an analysis, as carried out here. A uni- 
versal decay rate of all sunspots turns out to be incon- 
sistent with the observations, while a log-normal dist ribu- 
tion of decay rates, as post ulated by iHowardl l)l992j) and 
iMartmez Pillet et al.l l|l993|) reproduces the observations. 
The analysis presented here can be improved with obser- 
vational data that a) sample individual sunspots instead 
of sunspot groups, b) are observed at a higher cadence 
(e.g. hourly instead of daily) and c) are obtained for a 
homogeneous, time-independent spatial resolution. Space 
based imagers, such as the Michelson Doppler Imager 
(MDI) on the Solar a nd Hcliospheric Observatory (SOHO) 
( Scherrer et all Il995|) can provide such data. 
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